167 results found.
Written
Corpus,
Language Type:
Bilingual
Languages:
English French
Availability:
Freely Available
License:
Unspecified
Size:
55K sentence pairs Production Status:
Existing-used
Use:
Machine Translation, SpeechToSpeech Translation
-
Paper title:Investigating Catastrophic Forgetting During Continual Training for Neural Machine Translation
-
Paper track:Long paper/
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Shuhao Gu | wmt19 data | /N |
Documentation:
I don't konw.
Written
Corpus,
Language Type:
Multilingual
Languages:
English French German
Availability:
Freely Available
License:
Unspecified
Size:
4.5M en-de + 0.6M en-fr sentences pairs sentences Production Status:
Existing-used
Use:
Machine Translation, SpeechToSpeech Translation
-
Paper title:Investigating Catastrophic Forgetting During Continual Training for Neural Machine Translation
-
Paper track:Long paper/
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Shuhao Gu | WMT14 Data | /N |
Documentation:
I don't know.
Written
Corpus,
Language Type:
Multilingual
Languages:
English French German Japanese
Availability:
Freely Available
License:
Creative Commons Attribution-ShareAlike 4.0 International License
Size:
765 MByte Production Status:
Use:
Information Extraction, Information Retrieval
-
Paper title:Embedding Meta-Textual Information for Improved Learning to Rank
-
Paper track:Long paper/
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Shigehiko Schamoni | MetaCLIR: Meta-Textual Information for Cross-lingual Information Retrieval | /N |
Documentation:
None
Written
Corpus,
Language Type:
Monolingual
Languages:
French
Availability:
License:
OpenSource
Size:
3 GByte Production Status:
Newly created-in progress
Use:
Document Classification, Text categorisation
-
Paper title:Mama/Papa, Is this Text for Me?
-
Paper track:Short paper/
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Rashedur Rahman | Age prediction from text | /N |
Documentation:
None
Multimodal/Multimedia
Corpus,
Language Type:
Multilingual
Languages:
Czech English French German
Availability:
Freely Available
License:
CreativeCommons
Size:
31014 sentences Production Status:
Existing-used
Use:
Machine Translation, SpeechToSpeech Translation
-
Paper title:Supervised Visual Attention for Multimodal Neural Machine Translation
-
Paper track:Long paper/
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Tetsuro Nishihara | Multi30k | /N |
Documentation:
None
Written
Corpus,
Language Type:
Multilingual
Languages:
English French German Italian Portuguese Spanish
Availability:
Freely Available
License:
CreativeCommons
Size:
multilingual word embeddings in 30 languages and 110 bilingual dictionaries Production Status:
Existing-used
Use:
Machine Translation, SpeechToSpeech Translation
-
Paper title:A Locally Linear Procedure for Word Translation
-
Paper track:Short paper/
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Soham Dan | MUSE | /N |
Documentation:
https://github.com/facebookresearch/MUSE/blob/master/README.md
Written
Corpus,
Language Type:
Multilingual
Languages:
Chinese English French Japanese Korean Russian
Availability:
Freely Available
License:
Size:
5000 sentences Production Status:
Newly created-in progress
Use:
Analysis of cross-linguistic morphosyntactic divergences
-
Paper title:Fine-Grained Analysis of Cross-Linguistic Syntactic Divergences
-
Paper track:Long/Resources and Evaluation
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Dmitry Nikolaev | Aligned sub-corpus of Parallel Universal Dependencies | /N |
Documentation:
None
Written
Corpus,
Language Type:
Multilingual
Languages:
English French German
Availability:
Freely Available
License:
Size:
None Production Status:
Existing-used
Use:
Machine Translation, SpeechToSpeech Translation
-
Paper title:Translationese as a Language in "Multilingual" NMT
-
Paper track:Long/Machine Translation
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Parker Riley | WMT data | /N |
Documentation:
None
Speech/Written
Treebank,
Language Type:
Bilingual
Languages:
French North African Arabic
Availability:
Freely Available
License:
CC-BY-SA
Size:
1500 sentences Production Status:
Newly created-finished
Use:
Parsing and Tagging
-
Paper title:Building a User-Generated Content North-African Arabizi Treebank: Tackling Hell
-
Paper track:Long/Resources and Evaluation
-
Paper status:Accept - LREC
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Djamé Seddah | Narabizi Treebank | /N |
Documentation:
None
Written
Corpus,
Language Type:
Multilingual
Languages:
Czech English French German Spanish Swedish
Availability:
Freely Available
License:
CreativeCommons
Size:
7 GByte Production Status:
Existing-used
Use:
Information Extraction, Information Retrieval
-
Paper title:Document Translation vs. Query Translation for Cross-Lingual Information Retrieval in the Medical Domain
-
Paper track:Long/Information Retrieval and Text Mining
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Shadi Saleh | Extended CLEF eHealth 2013-2015 IR Test Collection | /N |
Documentation:
None




